Securing Interpretability: The Case of Ega Language Documentation

نویسندگان

  • Dafydd Gibbon
  • Catherine Bow
  • Steven Bird
  • Baden Hughes
چکیده

The prime consideration in designing sustainable language resources is to ensure that they remain interpretable for coming generations of users. In this paper we adopt a new perspective on resource creation securing the interpretability of data, using a case study of Ega, an endangered African language for which a small amount of legacy data is available. Basic steps to securing interpretability are to transfer files to durable media, and where possible, to convert all legacy data into XML files with Unicode character encodings. In the absence of agreed ‘best practice’ standards, we propose a methodology of ‘better practice’ to assist in the transition process towards this goal. We discuss a number of issues involved in securing interpretability of the lexicon, character encodings, interlinear glossed text, annotated recordings and nomenclature in linguistic descriptions, and describe our solutions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS

In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...

متن کامل

Use of the extreme groups approach: a critical reexamination and new recommendations.

Analysis of continuous variables sometimes proceeds by selecting individuals on the basis of extreme scores of a sample distribution and submitting only those extreme scores to further analysis. This sampling method is known as the extreme groups approach (EGA). EGA is often used to achieve greater statistical power in subsequent hypothesis tests. However, there are several largely unrecognized...

متن کامل

Impact of Controlled and Free Language Use in Retrieving Articles from the ProQuest and Science Direct Databases

Abstract Introduction: The growth and expansion of the Internet has changed the way information is accessed and many facilities have been created on the Web to facilitate and expedite information locating. Objective: To identify the impact of keyword documentation using the medical thesaurus on the retrieval of articles from Proquest and Science Direct databases. Materials and Methods:The pr...

متن کامل

Acquisition of English Relative Clauses by Adult Persian Learners: Focus on Resumptive Pronouns

Tsimpli and Dimitrakopoulou (2007) observed that uninterpretable features are unavailable in second language (L2) acquisition after the critical period. In this paper, we verify this claim by providing evidence from Persian speaking learners of English as an L2 on the status of resumptive pronouns (RPs) as uniterpretable features. Unlike English which does not allow RPs, Persian shows various b...

متن کامل

The Acquisition of Definiteness Feature by Persian L2 Learners of English

The definiteness feature in English is both LF and PF interpretable while Persian is a language in which this feature is LF-interpretable but PF-uninterpretable. Hence, there is no overt article or morphological inflection in Persian denoting a definite context. Furthermore, Persian partially encodes specificity not definiteness. In definiteness both the speaker and hearer are involved while in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004